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DATE: 09/25/97 
TIME: 12:34:44 



INPUT SET: S20S69.mw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



(1) 



General Information: 



(i) APPLICANT: Choo, Yen 

Klug, Aaron 

Sanchez Garcia, Isidro 

(ii) TITLE OF INVENTION: Improvements in or Relating to 
Binding Proteins for Recognition of DNA 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pillsbury Madison & Sutro, L.L.P. 

(B) STREET: 1100 New York Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20005-3918 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Word Perfect 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/GB95/01949 

(B) FILING DATE: 17-AUG-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9514698.1 

(B) FILING DATE: 18-JUL-1995 




NOV o 4 1997, 

c.nUUP 1800 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9422534.9 

(B) FILING DATE: 08-NOV-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9416880.4 
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INPUT SET: S20569.mw 





47 




(B) FILING 


DATE: 20-AUG-I994 ?• 






48 


















49 


(2) 


INFORMATION FOR 


SEQ 


ID NO: 1: 








50 


















51 




(i) SEQUENCE CHARACTERISTICS: 








52 




(A) LENGTH: 60 


base pairs 








53 




(B) TYPE: 


nucleic 


acid 








54 




(C) STRANDEDNESS : 


single 








55 




(D) TOPOLOGY: 


linear 








56 


















57 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


1 : 




58 


















59 


CTCCTGCAGT TGGACCTGTG CCATGGCCGG CTGGGCCGCA 1 




60 


CAACTAAAGC 














61 


















62 




INFORMATION FOR 


SEQ 


ID NO: 2: 








63 


















64 




(i) SEQUENCE CHARACTERISTICS: 








65 




(A) LENGTH 


: 92 


amino acids 








66 




(B) TYPE: 


amino acid 








67 




(C) STRANDEDNESS: 










68 




(D) TOPOLOGY: 


unknown 








69 


















70 




(ii) MOLECULE TYPE: 


protein 








71 


















72 


















73 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 


£t « 




74 


















75 




Met Ala Glu Glu 


Arg 


Pro 


Tyr Ala 


Cys 






76 






5 








10 




77 


















78 




Val Glu Ser Cys 


Asp 


Arg 


Arg Phe 


Ser 


Arg 




79 






15 








20 




80 


















81 




Ser Asp Glu Leu 


Thr 


Arg 


His He 


Arg 


He 




82 






25 








30 




83 


















84 




His Thr Gly Gin 


Lys 


Pro 


Phe Gin 


Cys 


Arg 




85 






35 








40 




86 


















87 




lie Cys Met Arg 


Asn 


Phe 


Ser Xaa 


Xaa 


Xaa 




88 






45 








50 




89 


















90 




Xaa Leu Xaa Xaa 


His 


Xaa 


Arg Thr 


His 


Thr 




91 






55 








60 




92 


















93 


o 


Gly Glu Lys Pro 


Phe 


Ala 


Cys Asp 


He 


Cys 




94 






65 








70 


^ 95 


















96 




Gly Arg Lys Phe 


Ala 


Arg 


Ser Asp 


Glu 


Arg 


» 

« 


97 


c 




75 








80 


i 
i 


98 
















i 


99 




Lys Arg His Thr 


Lys 


He 


His Leu 


Arg 


Gin 
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85 90 

Lys Asp 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TATGACTTGG ATGGGAGACC GCCTGG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATTCCAGGC GGTCTCCCAT CCAAGTCA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TATATAGCGT GGGCGTATAT A 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY j linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCGTATATAC GCCCACGCTA #ATA 

b 

(2) INFORMATION FOR SF,Q ID NO: 7: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TATATAGCGN NNGCGTATAT A 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGTATATAC GCNNNCGCTA TATA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTCCATGGAG ACGCAGAAGC CCTTCAGCGG CCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 i 
TTCCATGGAG ACGCAGGTGA GTTCCT8ACG CCA 
(2) INFORMATION FOR SEQ<jEED NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
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■ ry --0 INPUT SET: S20569.mw 

206 (C) STRANDEDNESStv's'lngie.'' ; 

207 (D) TOPOLOGY: linear . \ 
208 

209 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

210 

211 CCCCTTTCTC TTCCAGAAGC CCTTCAGCGG CCA 3 3 

212 

213 (2) INFORMATION FOR SEQ ID NO: 12: 
214 

215 (i) SEQUENCE CHARACTERISTICS: 

216 (A) LENGTH: 33 amino acids 

217 (B) TYPE: amino acid 

218 (C) STRANDEDNESS : 

219 (D) TOPOLOGY: unknown 
220 

221 (ii) MOLECULE TYPE: peptide 

222 

223 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

224 

225 Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

226 5 10 
227 

228 lie Cys Met Arg Asn Phe Ser Asp Arg Ser 

229 15 20 
230 

231 Ser Leu Thr Arg His Thr Arg His Thr Gly 

232 25 30 
233 

234 Glu Lys Pro 

235 

2 36 (2) INFORMATION FOR SEQ ID NO: 13: 
237 

238 (i) SEQUENCE CHARACTERISTICS: 

239 (A) LENGTH: 33 amino acids 

240 (B) TYPE: amino acid 

241 (C) STRANDEDNESS: 

242 (D) TOPOLOGY: unknown 
243 

244 (ii) MOLECULE TYPE: peptide 

245 

246 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

247 

248 Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

249 5 10 
250 

251 lie Cys Met Arg Asn Phe Ser Glu Arg Gly 

o 252 15 20 * « 

253 ; 

254 Thr Leu Ala Arg His Glu Lys His Thr Gly ^ 

255 25 30 I 
.256 I 

257 Glu Lys Pro i 

258 i 
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Line 



Error 



Original Text 




